IsoBase: a database of functionally related proteins across PPI networks
نویسندگان
چکیده
We describe IsoBase, a database identifying functionally related proteins, across five major eukaryotic model organisms: Saccharomyces cerevisiae, Drosophila melanogaster, Caenorhabditis elegans, Mus musculus and Homo Sapiens. Nearly all existing algorithms for orthology detection are based on sequence comparison. Although these have been successful in orthology prediction to some extent, we seek to go beyond these methods by the integration of sequence data and protein-protein interaction (PPI) networks to help in identifying true functionally related proteins. With that motivation, we introduce IsoBase, the first publicly available ortholog database that focuses on functionally related proteins. The groupings were computed using the IsoRankN algorithm that uses spectral methods to combine sequence and PPI data and produce clusters of functionally related proteins. These clusters compare favorably with those from existing approaches: proteins within an IsoBase cluster are more likely to share similar Gene Ontology (GO) annotation. A total of 48,120 proteins were clustered into 12,693 functionally related groups. The IsoBase database may be browsed for functionally related proteins across two or more species and may also be queried by accession numbers, species-specific identifiers, gene name or keyword. The database is freely available for download at http://isobase.csail.mit.edu/.
منابع مشابه
Introducing Potential Key Proteins and Pathways in Human Laryngeal Cancer: A System Biology Approach
The most common malignant neoplasm of the head and neck region is laryngeal cancerwhich presents a significant international health problem. The present study aims to screenpotential proteins related to laryngeal cancer by network analysis to further understandingdisease pathogenesis and biomarker discovery. Differentially expressed proteins were extractedfrom literatures of laryngeal cancer th...
متن کاملPPInfer : a Bioconductor package for inferring functionally related proteins using protein interaction networks
Interactions between proteins occur in many, if not most, biological processes. This fact has motivated the development of a variety of experimental methods for the identification of protein-protein interaction (PPI) networks. Leveraging PPI data available STRING database, we use network-based statistical learning methods to infer the putative functions of proteins from the known functions of n...
متن کاملIntroducing Potential Key Proteins and Pathways in Human Laryngeal Cancer: A System Biology Approach
The most common malignant neoplasm of the head and neck region is laryngeal cancerwhich presents a significant international health problem. The present study aims to screenpotential proteins related to laryngeal cancer by network analysis to further understandingdisease pathogenesis and biomarker discovery. Differentially expressed proteins were extractedfrom literatures of laryngeal cancer th...
متن کاملExploring Symmetric Substructures in Protein Interaction Networks for Pairwise Alignment
In molecular biology, comparison of multiple Protein Protein Interaction (PPI) networks to extract subnetworks that are conserved during evolution across di↵erent species is helpful for studying complex cellular machinery. Most e↵orts produce promising results in creating alignments that show large regions of biological or topological similarity between the PPI networks of various species, but ...
متن کاملSingling out functional similarities in graph databases
It has been shown that protein-protein interactions analysis may be useful to infer information about biological variations caused by evolution. All the known protein-protein interactions of a given organism may be modelled by a network, namely, the protein-protein interaction (PPI) network of that organism, stored in a graph database. The analysis and comparison of protein-protein interaction ...
متن کامل